Using Distributed Database Technology to Simplify the ETL Component of Data Warehouse
نویسندگان
چکیده
The increasing need for Decision Support Systems for business enterprises has lead to tremendous growth of Data Warehouses. Within large enterprises number of data sources is increasing; due to which Data Warehouses are getting more and more complex. The ETL componentone of the basic components of data warehousecan be made simpler with the use of Distributed Database technology in the development of data ware house. This will also result in the establishment of an infrastructure for information sharing within the enterprise.
منابع مشابه
ETL Extract , Transform and Load ( ETL ) Performance Improved by Query Cache
Extraction, Transformation, and Loading (ETL) processes are responsible for the operations taking place in the back stage of a data warehouse architecture Extract, transform and load (ETL) is the core process of data integration and is typically associated with data warehousing. ETL tools extract data from a chosen source, transform it into new formats according to business rules, and then load...
متن کاملXML based Framework for ETL Processes For Relational Databases
In Data Warehousing, Extraction-Transformation-Loading (ETL) are the key tasks that are responsible for the extraction of data from several sources, their cleansing, customization and insertion into data warehouse [10]. More specifically ETL tools are category of specialized tools with the task of dealing with data warehouse cleaning and loading problems. These task are very critical in every d...
متن کاملEtl Workflow Generation for Offloading Dormant Data from the Data Warehouse to Hadoop
The technologies developed to address the needs of Big Data have presented a vast number of beneficial opportunities for use alongside the traditional Data Warehouse (DW). There are several proposed use cases for using Apache Hadoop as a compliment to traditional DWs as a Big Data platform. One of these use cases is the offloading of "dormant data" that is, infrequently used or inactive data fr...
متن کاملOn Handling the Evolution of External Data Sources in a Data Warehouse Architecture
A data warehouse architecture (DWA) has been developed for the purpose of integrating data from multiple heterogeneous, distributed, and autonomous external data sources (EDSs) as well as for providing means for advanced analysis of integrated data. The major components of this architecture include: an external data source (EDS) layer, and extraction-transformation-loading (ETL) layer, a data w...
متن کاملInteroperable Distributed Data Warehouse Components
Extraction, Transformation and Loading (ETL) are the major functionalities in data warehouse (DW) solutions. Lack of component distribution and interoperability is a gap that leads to many problems in the ETL domain, because these ETL components are tightly-coupled in the current ETL framework. Furthermore, complexity of components extensibility is another gap in the ETL area, because of the sa...
متن کامل